Characterizing Classes of Potential Outliers through Traffic Data Set Data Signature 2D nMDS Projection

نویسندگان

  • Erlo Robert F. Oquendo
  • Jhoirene B. Clemente
  • Jasmine A. Malinao
  • Henry N. Adorna
چکیده

This paper presents a formal method for characterizing the potential outliers from the data signature projection of traffic data set using Non-Metric Multidimensional Scaling (nMDS) visualization. Previous work had only relied on visual inspection and the subjective nature of this technique may derive false and invalid potential outliers. The identification of correct potential outliers had already been an open problem proposed in literature. This is due to the fact that they pinpoint areas and time frames where traffic incidents/accidents occur along the North Luzon Expressway (NLEX) in Luzon. In this paper, potential outliers are classified into (1) absolute potential outliers; (2) valid potential outliers; and (3) ambiguous potential outliers through the use of confidence bands and confidence ellipse. A method is also described to validate cluster membership of identified ambiguous potential outliers. Using the 2006 NLEX Balintawak Northbound (BLK-NB) data set, we were able to identify two absolute potential outliers, nine valid potential outliers, and five ambiguous potential outliers. In a literature where Vector Fusion was used, 10 potential outliers were identified. Given the results for the nMDS visualization using the confidence bands and confidence ellipses, all of these 10 potential outliers were also found and 8 new potential outliers were also found.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Signature-Based Time Series Traffic Analysis on Coarse-Grained NLEX Density Data Set

In this study, we characterized traffic density modeled from coarse data by using data signatures to effectively and efficiently represent traffic flow behavior. Using the 2006 North Luzon Expressway North Bound (NLEX NB) Balintawak (Blk), Bocaue (Boc), Meycauayan (Mcy), and Marilao (Mrl) segments' hourly traffic volume and time mean speed data sets provided by the National Center for Transport...

متن کامل

A statistical test for outlier identification in data envelopment analysis

In the use of peer group data to assess individual, typical or best practice performance, the effective detection of outliers is critical for achieving useful results. In these ‘‘deterministic’’ frontier models, statistical theory is now mostly available. This paper deals with the statistical pared sample method and its capability of detecting outliers in data envelopment analysis. In the prese...

متن کامل

Hyperspectral Image Classification Based on the Fusion of the Features Generated by Sparse Representation Methods, Linear and Non-linear Transformations

The ability of recording the high resolution spectral signature of earth surface would be the most important feature of hyperspectral sensors. On the other hand, classification of hyperspectral imagery is known as one of the methods to extracting information from these remote sensing data sources. Despite the high potential of hyperspectral images in the information content point of view, there...

متن کامل

Who Should be Interviewed? A Response from Cluster Analysis

Objective: This article presents an application of cluster analysis for social sciences researches especially those studies that have an interview as part of their data collection. This application is more suitable for sequential mixed method researchers who use quantitative data to frame subsequent qualitative subsamples for conducting interviews.  Methods: In more detail, the algorithm (i....

متن کامل

مجموعه حداقل داده‌های پرستاری : یک نیاز ضروری برای نظام‌‌ مراقبت بهداشتی درمانی در ایران

  Background & Aim: Nurses are the largest groups in health care delivery system. Nursing Information systems (NIS) are important for improving nursing performance, increasing nursing knowledge and providing data and information needed for nursing. Identifying Nursing Minimum Data Set (NMDS) is the first step for development of NIS. Considering the absence of NMDS in Iran, this study was conduc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1702.07501  شماره 

صفحات  -

تاریخ انتشار 2010